WEIGHTED REGULARITY LEMMA WITH APPLICATIONS 



BELA CSABA AND ANDRAS PLUHAR 

Abstract. We prove an extension of the Regularity Lemma with vertex and edge 
weights, which can be applied for a large class of sparse graphs. 



1. Introduction 

Let G = G(A, B) be a bipartite graph. For X, Y C A U B let e G (X,Y) denote 
the number of edges with one endpoint in X and the other in Y. We say that the 
(A, £>)-pair is e-regular if 



e G (A',B') e G (A,B) 



< e 



\A'\\B'\ \A\\B 
for every A C A, \A'\ > e\A\ and B' C B, \B'\ > e\B\. 

This definition plays a crucial role in the celebrated Regularity Lemma of Sze- 
meredi, see [HI [9]. Szemeredi invented the lemma for proving his famous theorem 
on arithmetic progression of dense subsets of the natural numbers. The Regularity 
Lemma is a very powerful tool when applied to a dense graph. It has found lots of 
applications in several areas of mathematics and computer science, for applications 
in graph theory see e.g. [7j. However, it does not tell us anything useful when applied 
for a sparse graph (i.e., a graph on n vertices having o(n 2 ) edges). 

There has been significant interest to find widely applicable versions for sparse 
graphs. This turns out to be a very hard task. Kohayakawa [5J proved a sparse 
regularity lemma, and with Rodl and Luczak [6] they applied it for finding arith- 
metic progressions of length 3 in dense subsets of a random set. In their sparse 
regularity lemma dense graphs are substituted by dense subgraphs of a random (or 
quasi-random) graph. Naturally, a new definition of e-regularity was needed, below 
we formulate a slightly different version from theirs: 

Let F(A, B) and G(A, B) be two bipartite graphs such that F C G. We say that 
the (A, 5)-pair is e-regular in F relative to G if 

e F (A',B') e F {A,B) 



e G (A',B') e G (A,B) 



< e 



Key words and phrases. Regularity lemma, sparse graphs. 

The authors were partially supported by the grants OTKA T049398 and OTKA K76099. The 
first author was partially supported by a New Faculty Scholarship Grant of the Office of Sponsored 
Programs at Western Kentucky University. 

1 



2 



CSABA AND PLUHAR 



for every A' C A, B' C B and \A'\ > e\A\, \B'\ > e\B\. It is easy to see that the above 
is a generalization of e-regularity, in the original definition the role of G is played by 
the complete bipartite graph K^.b- in this more general definition F can be a rather 
sparse graph, it only has to be dense relative to G, that is, e(F)/e(G) should be a 
constant. 

In this paper we further generalize the notion of quasi-randomness and e-regularity 
by introducing weighted regularity using vertex and edge weights. This enables us to 
prove a more general, and perhaps more applicable regularity lemma. Let us remark, 
that a new notion, volume regularity was recently used by Alon et al. |2J. As we 
will see later, volume regularity is a special case of weighted regularity. Still, these 
approaches are not fully comparable, since they use different premises for finding the 
regular partition of a graph. In this paper we discuss the Regularity Lemma for dense 
subgraphs of weighted quasi-random graphs, while Alon et al. find a regular partition 
for so called (C, r/)- bounded graphs. 

In the proof we will use the Strong Structure Theorem of Tao [10] , that simplifies 
the proof and allows a shorter exposition. We remark that similar ideas might be 
used to find a regularity lemma for certain sparse hypergraphs as well. 

The structure of the paper is as follows. First we discuss weighted quasi-randomness 
and weighted e-regularity in the second section. In the third section we prove the 
new version of the regularity lemma. Finally, we show some applications in the fourth 
section. 

2. Basic definitions and tools 

Let (3 > and G = (V,E) be a graph on n vertices. Set 5g = e(G)/(^), this 
is the density of G. We define the density of the A, B pair of subsets of V(G) by 
Sg(A, B) = ec(A, B)/(\A\\B\). We say that G is /5-quasi-random if it has the following 
property: If A, B C V(G) such that A n B = and \A\, \B\ > (3n then 

\6 g -6g(A,B)\<P5 g . 

That is, the edges of G are distributed "randomly." In order to formulate our regu- 
larity lemma we have to define quasi-randomness in a more general way, that admits 
weights on vertices and edges. 

For a function w : S — > M + and A C S, w(A) is defined by the usual way, that 
is, w(A) = J2xeA w ( x )- We shall also use the indicator function of the edge set of a 
graph H. 1 H : ( v f } ) -> {0, 1} and l H (x,y) = 1 iff xy e E(H). 

We define the weighted quasi-randomness of a graph G = (V, E) with given weight- 
functions fi : V -> R+ and p : -> R + . For A,BcV let 

p G {A,B):= 1 g(u 1 v)p(u,v). 

uGA,v£B 

In particular, p G (u,v) = 1g{u,v)p(u,v) for u,v E V. 
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Definition 1. A graph G = (V,E) is weighted (3 -quasi-random with weight-functions 
p and p if for any A, B C V(G) such Af]B = and p(A) > (3p(V), p(B) > (3p{V) 
we have 

p G (A,B) PG (V,V) 
p(A)p(B) p(V)p(V) 

Observe that choosing p = 1 and p = 1/5g gives back the first definition of quasi- 
randomness. There is another, weaker notion of quasi-randomness, which we will also 
use. 

Definition 2. Let D > 1 be an absolute constant. A graph G = (V, E) is weighted 
(D, f3)- quasi-random with weight-functions p and p if for any A, B C V(G) such 
AnB = and p{A) > (3p{V), p{B) > (3p{V) we have 

1 Pg(V,V) Pg(A,B) Pg (V,V) 
D p(V)p(V) ~ p(A)p(B) ~ p(V)p(V) ■ 

Clearly, if a graph is /3-quasi-random, then it is (D, (3) -quasi-random unless D is 
very close to one. Now we need to describe the weighted version of relative regularity. 

Definition 3. Let G and F be graphs such that F C G and p, p weight functions 
defined as above. For A, B C V(G) and A fl B = the pair (A, B) in F is p — p- 
weighted e-regular relative to G, or briefly weighted e-regular, if 

p F (A',B') PF (A,B) 
p(A>)p(B>) p{A)p{B) ' 

for every A' C A and B' C B provided that p(A') > ep(A),p(B') > ep(B). Here 

Pf(A,B)= 1f(u,v)p(u,v). 

ueA,v<=B 

Remark. Note that weighted e-regularity is nothing but the well-known e-regularity 
when G = Ka,b and p = 1 and p is chosen to be identically the reciprocal of the 
density of G as before. 

Next we define weighted regular partitions. 

Definition 4. Let G = (V, E) and F C G be graphs, and p and p weight functions. 
F has a weighted e-regular partition relative to G if its vertex set V can be partitioned 
into t + 1 clusters W , W±, . . . , Wt such that 

• P(W ) < ep(V), 

• \p(Wi) - p(Wj)\ < max xeV {p(x)} for every 1 < i, j < £, 

• all but at most e£ 2 of the pairs (Wi, Wj) for 1 < i < j < £ are weighted 
e-regular in F relative to G. 
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In order to show our main result we will use the Strong Structure Theorem of 
Tao, that allows a short exposition. In fact we will closely follow his proof for the 
Regularity Lemma as discussed in [10]. First we have to introduce some definitions. 

Let if be a real finite-dimensional Hilbert space, and let S be a set of basic functions 
or basic structured vectors of H of norm at most 1. The function g G H is (M, .Re- 
structured with the positive integers M, K if one has a decomposition 

9 = ^2 CiSi 

l<i<M 

with Si G S and q G [-K, K] for 1 < i < M. We say that g is /3-pseudorandom for 
some (5 > if | (g, s) | < /3 for all s G S. Then we have the following 

Theorem 1 (Strong Structure Theorem - T. Tao). Let H and S be as above, let e > 0, 
and let J : Z + — > R + be an arbitrary function. Let f G H be such that \\f\\u < 1- Then 
we can find an integer M = Mj^ e and a decomposition f = f str + f ps d + f err where (i) 
fstr is (M, M)- structured, (ii) f ps d is 1/ J{M) -pseudorandom, and (Hi) \\f e rr\\H < 

3. Weighted Regularity Lemma relative to a quasi-random graph G 

First we define the Hilbert space H, and S. We generalize Example 2.3 of [10] to 
weighted graphs. Let G = (V, E) be a /3-quasi-random graph on n vertices with weight 
functions jj and p. Let H be the (™) -dimensional space of functions g : (^) — > R, 
endowed with the inner product 

(#> h } = 7n\ g(u,v)h(u,v)p G (u,v). 

It is useful to normalize the vertex and edge weight functions, we assume that 
fJ,(V) = n and (1,1) = 1. We also assume, that p{v) = o(\V\) for every v G V. 
Observe, that if F C G then ||1f|| < 1. We let S to be the collection of 0,1-valued 
functions ^a,b for A, B C V(G), A PI 5 = 0, where 7a,b(w, f ) = 1 if and only if u G A 
and t> G B. We have the following 

Theorem 2 (Weighted Regularity Lemma). Let D > 1 and /3, e G (0,1), sitc/i £/ia£ 
< /3 £ 1/Z) and /ei L > 1. If G = (V,E) is a weighted (D, 0) -quasi-random 
graph on n vertices with n sufficiently large depending on e and L and F C G, 
then F admits a weighted e-regular partition relative to G into the partition sets 
Wq, Wi, . . . ,Wg such that L < £ < C S) l for some constant C e> L. 

Proof: Let us apply Theorem [T] to the function 1 p with parameters rj and function 
J to be chosen later. We get the decomposition 

1-F fstr fpsd "I - fern 

where f str is (M, M )-structured, f ps d is 1/ J(M)-pseudorandom, and \\f e rr\\ < V with 
■\l Ml, Mi. ■ 
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The function f str is the combination of at most M basic functions: 



fstr = ^2 &klA k 



Kk<M 



where Ak,Bk are subsets of V and jA k ,B k agrees with the indicator function of the 
edges of G in between Au and Bk- Any (Ak, Bk) pair partitions V into at most 4 
subsets. Overall we get a partitioning of V into at most 4 M subsets, we will refer to 
them as atoms. Divide every atom into subsets of total vertex weight L ^ M , except 
possibly one smaller subset. The small subsets will be put into Wq, the others give 
Wi, W 2 , • • • , We, with £ = L+ £ 4 . We refer to the sets Wi for % — 1, . . .£ as clusters. 
If n is sufficiently large then this partitioning is non-trivial. From the construction 
it follows that each Wi is entirely contained within an atom. It is also clear that 
A* (Wo) < en and p(Wi) pa m = 9(|) for every 1 < % < i. 
We have that 

\\ferr\\ 2 = i ^ \f er r(u,v)\ 2 p G (u,v) < if. 

From this and the normalization of p it follows that 

7^ E Pg{ W uWj ) E l/e,(v)|V(^)=0(, 2 ). 



Clearly, 



m ! w x E l/err(w,u)| 2 pG(«,u) = 0(77) 
PGlW, Wj) 



ueWi.veWj 



for all but at most 0(rj£ 2 ) pairs (i,j). If the above is satisfied for a pair then 
we call it a good pair. We will apply the Cauchy-Schwarz inequality. For that let 
a(u,v) = \f err (u,v)\y/p G (u,v) and 6(w,u) = y/p G (u,v), then 



-4/ E ° 2 K U )- 



Since 



we get that 



J2 a*(u,v)=0( y /rj)y/p G (W i ,W j ), 

u£Wi,veWj 



^ ^ I ferr (u,v)\p G (u,v) = 0(y/rj) 



if is a good pair. 
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Assume that is a good pair. From the pseudorandomness of f ps d we have that 



1 



I (fpsd, 1a,b) I - "^J 



< 



1 



J(M) 



fpsd(u,v)p G (u,v) 

ueA,veB 

for every AdWi and B cWj. 

We will show that every good pair is weighted e-regular in F relative to G. Let 
be a good pair, and assume that A C fJ>(A) > ep{Wi) and B C Wj, 
/jl(B) > eii{Wj). If (Wi,Wj) is weighted e-regular, then 



p F (A,B) p F (W t ,W 3 



< e. 



p(A)p(B) n(WMWj) 
Recall that 

Pf(A,B)= ^ l F (u,v)p(u,v)= 1f(u,v)p g (u,v), 

u€A,v€B ueA,v€B 

since F C G. 

Substituting f str + f ps d+ ferr for If it is sufficient to verify the following inequalities. 



and 



(1) 
(2) 

(3) 



J2ueA,veBfstr(u,v)p G (u, 


v) 


Y^ueWuveWi fstr(u, v)p G (u, v) 


p(A)p{B) 






J2ueA,veBf P sd(u,v)p G (u 


v) 


YsueWuveWi fpsd(u, v)p G {u, v) 


p(A)p{B) 






J2u€A,v€B ferr(u,v)p G (u, 


v) 


^ueW^veWj ferr(u, v)p G (u, v) 


p(A)p{B) 




p(wmw 3 ) 



<e/3, 

< e/3 

< e/3. 



For proving (1) recall that f str is constant on W,- L x Wj and (M, M) -structured. 
Since the 7x,y basic functions are 0, 1-valued, we get, that |/ s tr| < M 2 . Moreover, 
G is (D, /3)-quasi-random, where < C e. Therefore, (1) < DM 2 (3 < e/3, since 
(3 « e. 

The proof of (2) goes as follows. The first term is 



p(A)p(B) 



the second is 



fpsd(u,v)p G (u,v) 



n 



\(fpsd, lWi,Wj)\ < 



J(M)/i(^)/i(^)' 
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Noting that fJ>(Wk) = ®(n/£) for k > 1 we get that the sum of the above terms is at 
most 

° 2 ' 1\ e 

1 + < -, 



2 J(M) \ e 2 J 3 



if J(M) » §. 

For (3) first notice that it is upper bounded by 



()(vTl) \^WMW 3 ) + ^(A)ii(B) ) ~ °^'e*r{WMWj) 



We also have that 

PG (w^w 3 ) _ 0{{ 



by the normalization of \x and p and from the fact that G is quasi-random. From this 
it is easy to see that if rj <C £ 6 then (3) is at most e/3. This finishes the proof of the 
theorem. □ 



4. Applications 

In this section we consider a few applications of our main result. First we prove 
that a random graph with widely differing edge probabilities is quasi-random, if none 
of the edge probabilities are too small. In this case the vertex weights will all be 
one, but edges will have different weights. Then we show examples where vertices 
have different weights. We will also consider the relation of weighted regularity and 
volume regularity. 

4.1. Quasi-randomness in the G(n,pij) model. In this section we will prove that 
random graphs of the G(n,Pif) model are quasi-random in the strong sense with high 
probability. A special case of this model is the well-known G(n,p) model for random 
graphs. A Regularity Lemma for this case was first applied by Kohayakawa, Luczak 
and Rodl in [6j. They studied G(n,p) for p = c/^fn in order to find arithmetic 
progressions of length three in dense subsets of random subsets of [Nj. 

The G(n,pij) model was first considered by Bollobas jl]. Recently it was also 
studied by Chung et al. []]. 

In this model one takes n vertices, and draws an edge between the vertices Xi and Xj 
with probability pij, randomly and independently of each other. Note that if Pij = p, 
then we get back the well-known G(n,p) model. It is a straightforward application 
of the Chernoff bound that a random graph G G G(n,p) is quasi-random with high 
probability. However, the case of G(n,pij) is somewhat harder. 

Lemma 3. Let (5 > 0. There exists a K = K{(3) such that if G £ G(n,pij) and 
Pij > K/n for every i and j , then G is weighted f3 -quasi-random with probability at 
least 1 — 2 _n if n is sufficiently large. 
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Proof. First of all let n = \, and let p(i,j) = l/p iyj . Set K = 4800//3 6 . Let p = K/n, 
and let p k = e k po for 1 < k < logn. Let A and B be a pair of disjoint sets, both of 
size at least j3n. We partition the pairs (u, v), where u G A and v £ B, into O(logn) 
disjoint sets Hi,H 2 , . . . , Hi : if pk < p uv < Pk+i then (u,v) will belong to H^. Let 
dk = jQ\/e k Kn. We will denote \Hk\ by m^. 

We will prove that the following inequality holds with probability at least 1 — 2~ 3n : 



EX UV 
Puv\A\\B\ 



</3/2, 



where X U1 , is a random variable which is 1 if uv 6 E(G), otherwise it is 0. This implies 
the quasi-randomness of G since there are less than 2 2n pairs of disjoint subsets of 
V(G). Observe that 

UdA,VdB 11 1 

Applying the large deviation inequalities A. 1.11 and A. 1.13 from |BJ, we are able 
to bound the number of edges in between A and B for the edges of Hk in case rrik is 
sufficiently large as follows. According to A. 1.11 we have that 

Pr (X uv -EX uv )>a k \ <e 2 ^ 

\(u,v)eH k J 

where 

^ sr~^ Puv „ 

Pk<qk= > — < Pk+i- 

(u,v)<=H k 



We estimate the exponent in case = n 



2. 




- <~(—) - ——— ! ——— 1 — - < -3//. 



2g fc m fc 2g£m£ 200 

where we used the definition of K. For mj, being much less than n 2 direct substitution 
gives a useless bound. For this case we have the useful inequality 



2 

,i=l 

where Pr(Fj = 1 — q^) = q k and Pr(YJ = — = 1 — qk- This implies that the exponent 
is at most — 3n even in case rrik < n 2 . 

2 

Indeed, let A,B and C be the events that > a fc, XT=i ^ > a ^/^ and 

2 

^r=m fe+1 ^ < — fl fc/2, respectively. Clearly A and C are independent, and AdC C 5. 
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So we have Pr(£) > Pi(AnC) = Pr(A)Pr((7), that is Pr(A) < Pr(.B)/Pr(C7) < 
Pr(B)/2, since by A1.13 



Pr j 53 ^ < ~~ I < e 89fc( " 2 ~ mfc) < -• 



i=rrt fc+1 



With this we have proved that the sum of the weights of the edges of H k will not be 
much larger than their expectation with high probability. 

Now we estimate the probability that the sum of the weights is much less than 
their expectation. Let us use A. 1.13 again directly to the sums over H k s: 



Pr [ 53 ( X ™ ~ EX ™) < ~ a k ] < e 
(u,v)eH k 



2q k m k 



The exponent in the inequality can be estimated very similarly as before: 



2q k m k 



< 



(3 6 ( 2 \ Kir 



200 



em k 



< — 3n, 



moreover, this bound applies for an arbitrary m k . 
Putting these together we have that 



Pr 



53 (Xuv ~ 

(u,v)eH k 



> a k \ < 2 



-3n 



This implies that 



EX UV — ¥,X UV 
Puv\A\\B\ 

(u,v)eH k 



< 



EX UV MX UV 
Pk~AA\\B\ 

(u,v)eH k 



< 



< 



P ( 1 



p k -i\A\\B\ ~ 10 \yfl) ' 



where the last two inequalities hold with probability at least 1 — 2 3n for a given pair 
of sets A and B if n is sufficiently large. Since 



E 

uGA,vGB 



Puv\A\\B\ 



log n 

E E 

k=l (u,v)eH k 



Puv\A\\B\ 



log n 

^E 

k=i 



E 

(u,v)£H k 



X„ 



Pk-i\A\\B\ 



and 



logn . 

53 Tn ( 77 



< 



1 



^ 10 V \/e/ ~ 2' 

fc=l w / 

the claimed bound follows with high probability. 



□ 
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Remark. It is very similar to prove that with high probability | £\ . pa(i,j) — (™) I — 
o(n), we omit the details. From this it follows that rescaling the above edge weights 
by a factor of (l+o(l)) and letting p = 1 provides us /3-quasi-random weights for most 
graphs from G(n,pij) such that p{V) = n and pciY, V) = 2("). That is, with high 
probability we can apply the Regularity Lemma for any F C G, where G G G(n,Pij). 

4.2. Other examples for defining vertex and edge weights. When defining the 
notion of weighted quasi-randomness and weighted regularity, we mentioned, that 
choosing p = 1 and p = 1/5q gives back the old definitions of quasi-randomness and 
regularity. In the previous section we saw an example when we needed different edge 
weights, but p was identically one. 

Let us consider a simple example in which p has to take more than one value. Let 
G be a star on n vertices, that is, the vertex v\ is adjacent to the vertices v 2 , . . . , v n , 
and Vi has degree 1 for i > 2. We let p{v\) = 1/2 and p{vj) = l/(2(n — 1)) for 
i > 2, and choose pc = n/2. With these choices G is easily seen to be quasi-random, 
moreover, it is weighted regular. 

A more sophisticated example relates weighted regularity with volume regularity, 
the latter introduced by Alon et al [2J. Given a graph G, the volume of a set S C V(G) 
is defined as vol(S) = J2 v es deg(u). For disjoint sets A,Bc V(G), the (A, B) pair is e- 
volume regular if VX C A, Y C B satisfying vol(A) > e-vo\(A) and vol(y) > e-vol(fi) 
we have 



<e-vol(A)vol(S)/vol(y). 



In order to show that volume regularity is a special case of weighted regularity, we 
may choose edge and vertex weights as follows: for all A,BcV let 

p(A ' B) = e{A ' B) ^MV) 

and 

, .. n ■ vol (A) 
vol(VJ 

It is easy to see that if the (A, B) pair is p — p-weighted e-regular with the above 
choice for p and p, then it is also e-volume regular. 

In certain cases it can be more useful to apply the notion of weighted regularity 
than volume regularity. As an example, consider a bipartite graph G(A,B), where 
A = A 1 U A 2 , B = B x U B 2 and A x n A 2 = B 1 n B 2 = 0. We assume that \A X \ = 
\A 2 \ = \Bi\ = \B 2 \ = n/4. Further assume that G(A\,B) and G(A, B\) are complete 
bipartite graphs, and G(A 2 , B 2 ) is a random bipartite graph with edge probability 

Letting X = A 2 and Y = B 2 , it is easy to see that G(A, B) is not e-volume regular, 
if e is small, no matter how large n is. On the other hand, the result of the previous 
section implies that G is weighted regular, even with a very small e. 
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